New Tasks on Collections of Digitized Books

نویسندگان

  • Gabriella Kazai
  • Antoine Doucet
  • Monica Landoni
چکیده

Motivated by the plethora of book digitization projects around the world, the Initiative for the Evaluation of XML Retrieval (INEX) launched a Book Search track in 2007. The track focused on Information Retrieval (IR) tasks, exploring the utility of traditional and structured document retrieval techniques to books. In this paper, we propose four new tasks to be investigated at the Book Search track. The tasks aim to promote research in a wider context across IR, Human Computer Interaction, Digital Libraries and eBooks. We identify novel problem areas, define tasks around these and propose possible evaluation methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Finding information in books: characteristics of full-text searches in a collection of 10 million books

Searching large collections of digitized books is a relatively new area in information-seeking and retrieval research, made possible by initiatives such as Google Books and the HathiTrust Digital Library. The availability of large full-text book collections is transforming how users search and interact with information in books, but the characteristics of these changes are unknown. This paper a...

متن کامل

The Effect of Different Types of Classroom Tasks on Learning New Vocabulary of a New Lesson by Iranian EFL Learners; With a Focus on High School English Books: Vision 2

Learning the new vocabulary of a new lesson by high school students before starting to teachthe whole of lesson in every session is the main concern of the writer of this study as anEnglish teacher. So the purpose of the present study is introducing some practical classroomtasks (such as, working with oxford dictionary for looking up the definitions, synonyms,antonyms and examples of new words,...

متن کامل

David McKitterick. Old Books, New Technologies: The Representation, Conservation, and Transformation of Books since 1700. New York: Cambridge University Press, 2013. 286p. (ISBN: 978-1-107-03593-5). LCCN: 2012-38444

From his base at Trinity College, Cam-bridge, David McKitterick has been one of our best scholar-librarians, and his contributions to book history studies have been fundamental. He has now given us another important monograph that all special collections librarians would benefit from reading. While the title of the book is an accurate summary of its contents, it disguises McKitterick's real pur...

متن کامل

Adding New Content Types To A Large-Scale Shared Digital Repository

HathiTrust is a collaboration of universities working together to establish a repository that archives and shares their digitized collections. Initially, the Submission Information Packages (SIPs) deposited into HathiTrust were extremely uniform, being constituted primarily of books digitized by Google. HathiTrust’s ingest validation processes were correspondingly highly regular, designed to en...

متن کامل

Preservation of ebooks: from digitized to born-digital

The scope of digital curation at the BnF covers documents digitized from BnF collections as well as born-digital material bought by the BnF or collected under its legal deposit mandate. It is therefore critical for the library to investigate if common approaches may be adopted for similar document types, whatever their origin may be. This paper proposes to focus on the case of electronic books ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008